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Lactobacillus fermentum is a normal inhabitant of the human gastrointestinal tract. Here, we report the draft genome sequence 
of an Indian isolate of the probiotic strain L. fermentum Lfl, isolated from the human gut. 
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Lactobacillus fermentum, a heterofermentative species of the ge- 
nus Lactobacillus, is a normal inhabitant of the human gastro- 
intestinal tract and possesses probiotic attributes other than its 
commercial uses in food and feed fermentation. The complete 
whole-genome sequences of strains L. fermentum IFO 3956 and 
L. fermentum CECT 5716, a probiotic strain from human miHc, 
have been determined (1, 2). Recently, Maldonado et al. (3) re- 
ported that strain CECT 5716 may be useful for the prevention of 
community-acquired gastrointestinal and upper respiratory in- 
fections. L. fermentum strains E-3 and E-18 have also been re- 
ported to possess strong antioxidative properties (4). 

L. fermentum Lfl, an indigenous isolate from the human gut, 
was identified using 16S rRNA (accession no. KC509914) and 
rpoA (accession no. KC509912). It has been deposited in the In- 
ternational Depository Budapest Treaty at the Microbial Type 
Culture Collection (MTCC) as strain MTCC 5689. Lfl possesses 
probiotic attributes, viz. in acid and bile tolerance and autoaggre- 
gation ability, and it inhibits gastrointestinal pathogens. It has also 
been found to upregulate the transcription factor Nrf2 and inhib- 
its lipid peroxidation (R. Chauhan, V. K. Batish, and S. Grover, 
unpublished data). The functional efficacy of Lfl in terms of its 
anti-inflammatory and antioxidative properties was demon- 
strated in a dextran sodium sulfate (DSS)-induced colitis mouse 
model. The disease activity index (DAI) and histological scores of 
the colitis mouse group fed with Lfl improved significantly com- 
pared to those in colitis mice, and the Lfl -fed group prevented 
lipid peroxidation. Since Lfl might be an ideal probiotic, its 
genomic sequence was deciphered. 

Genome sequencing was performed using lUumina Solexa ge- 
nome analyzer (HiScanSQ genome analyzer; lUumina, Inc.). A 
total of 7,665,410 high-quality reads were obtained after filtering 
with a Phred score of <20, and these were used for alignment with 
L. fermentum IFO 3956 (accession no. NC_010610.1) as the refer- 
ence genome. Of the total reads, 83.50% were aligned with the 
reference genome, with 86.90% genome coverage and total gap 
length of 0.27 Mb. A genome comparison of Lfl with that of L. 



fermentum IFO 3956 revealed 9,210 single-nucleotide polymor- 
phisms (SNPs), which were mapped to 653 genes. 

The draft genome of Lfl was assembled into a single circular 
chromosome of 2,098,685 bp using the reference L. fermentum 
IFO 3956 sequence using the Burrows-Wheeler Aligner (BWA) 
(5) to insert Ns to fill the gaps. The G+C content of Lfl is 52.5%, 
and 1,620 predicted protein-coding genes were identified by using 
Glimmer 3.02 (6), followed by manual curation. A total of 1,233 
(76.1%) genes were annotated with known functions using 
BLASTp against the NCBI nonredundant (NR) and Clusters of 
Orthologous Groups (COG) databases (7). Three hundred eighty- 
seven (23.9%) genes were annotated as hypothetical, conserved 
hypothetical, or unnamed protein products. The Lfl strain con- 
tains five rRNA gene operons (5S, 16S, and 23S rRNAs) and 57 
tRNA genes in the genome. A total of 1,389 genes were classified 
into 20 functional COGs, of which the most abundant classes were 
replication, recombination, and repair (18%), amino acid trans- 
port and metabolism (13%), and transcription (11%). 

Nucleotide sequence accession numbers. The L. fermentum 
Lfl whole-genome shotgun project has been deposited at DDBI/ 
EMBL/GenBank under the accession no. AWXSOOOOOOOO. The 
version described in this paper is version AWXSOIOOOOOO. 
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